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(57) Abstract 

Methods and DNA constructs are provided for detection and manipulation of a target eucaryotic gene whose expression is restricted 
to certain tissues or specialized cell types. The methods include transforming a cell with a first indicator component under the control of a 
promoter selected for its restricted expression in a particular cell or tissue. The cell is also transformed with a gene trap vector encoding a 
second indicator component The cell is allowed to differentiate to produce a specialized cell or tissue which is monitored for expression 
of both the first and second indicator components, thereby detecting a gene into which the trap vector has integrated which is expressed in 
the same cell or tissue type as the selected promoter. 
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COMPLEMENTATION TRAP 
Field of the Invention 

5 

The present invention relates to gene entrapment 
vectors and their use in gene discovery, and their use in 
screening for or making cells and organisms that are 
mutated for such genes. This invention also relates to the 

10 use of such entrapment vectors to identify tissue specific 
transcription control elements such as promoters and 
enhancers and for generating transgenic animals displaying 
restricted expression of transgenes. This invention also 
relates to trap vectors comprising a splice acceptor and a 

15 sequence encoding a reporter gene. 

Background of the Invention 

Genomic based drug discovery is largely dependent upon 
20 the identification of specific genomic targets. Thus, 
cloning, sequencing, and identification of function of 
mammalian genes is a first priority in a genomic based drug 
discovery. In particular, it is important to identify and 
make use of genes which are spacially and/or temporally 
25 regulated in the organism. 

Animal model systems such as the fruit fly and the 
worm are often used in gene identification because of ease 
of manipulation of the genome and ability screen for 
30 mutants. While these systems have their limitations, large 
numbers of developmental mutations have been identified in 
those organisms either by monitoring the phenotypic effects 
of mutations or by screening for expression of reporter 
genes incorporated into developmentally regulated genes. 

35 

Many features of the mouse make it the best animal 
model system to study gene function. However, the mouse 
has not been used for large scale classical genetic 
mutational analysis because random mutational screening and 
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analysis is very cumbersome and expensive due to long 
generation times and maintenance costs. 

A disadvantage in using animal models for the 
5 identification of genes is the need to establish a 
transgenic animal line for each mutational event. This 
disadvantage is alleviated in part by using embryonic stem 
(ES) cell lines because mutational events may be screened 
in vitro prior to generating an animal. ES cells are 

10 totipotent cells isolated from the inner cell mass of the 
blastocyst. Methods are well known for obtaining ES cells, 
incorporating genetic material into ES cells, and promotion 
of differentiation of ES cells. ES cells may be caused to 
differentiate in vitro or the cells may be incorporated 

15 into a developing blastocyst in which the ES cells will 
contribute to all differentiated tissues of the resulting 
animal. Vectors for transforming ES cells and suitable 
genes for use as reporters and selectors are also well 
known. 

20 

Gene entrapment strategies have been employed to 
identify developmentally regulated genes. One type of 
entrapment vector is called a "promoter trap", which 
consists of a reporter gene sequence lacking a promoter. 

25 Its integration is detected when the reporter is integrated 
"in-frame" into an exon. "Gene trap vectors" target the 
much more prevalent introns of the eucaryotic genome. The 
latter vectors consist of a splice-acceptor site upstream 
from a reporter gene. Integration of the reporter into an 

3 0 intron results in a fusion transcript containing RNA from 
the endogenous gene and from the reporter gene sequence. 

Gene trap vectors may be made more efficient by 
incorporation of an internal ribosomal entry site (IRES) 
35 such as that derived from the 5' non-translated region of 
encephalomyocarditis virus (EMCV) . Placement of a IRES 
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site between the splice acceptor and the reporter gene of 
a gene trap vector means that reporter gene product need 
not be translated as a fusion product with the endogenous 
gene product, thereby increasing the likelihood that 
5 integration of the vector will result in expression of the 
reporter gene product. 

Examples from the literature of the use of promoter 
and gene trap vectors as well as such vectors including an 
10 IRES sequence, are listed below. Some examples involve the 
identification of developmentally regulated or tissue 
specific events making use of ES cell lines. 

1. Canadian Patent application no. 2,166,850 (open for 
15 public inspection July 11, 1996) Vectors and the Use 

Thereof for Capturing Target Genes : describes the use of 
transmembrane sequence encoding gene trap vectors to 
isolate and identify secretory proteins. Also see United 
States Patent No. 5,767,336 issued June 16, 1998. 

20 

2. US Patent 5,364,783 issued Nov. 15, 1994. Retrovirus 
Promoter Trap Vectors : describes retroviral vectors that 
are used to isolate transcriptionally active chromosomal 

25 regions and to identify promoter sequences. The reporter 
gene is placed in the U3 or U5 control region of the 
retrovirus . 

2. Gossler, A., et al. (1989). Mouse Embryonic Stem 
30 Cells and Reporter Constructs to Detect Developmentally 
Regulated Genes . Science 244:463-465: describes the use 
of enhancer trap gene trap vectors for use in identifying 
developmentally regulated genes. The gene trap vector 
consists of the mouse En- 2 splice acceptor upstream from 
35 lacZ (reporter) and a selector gene (hBa-neo) . 
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Summary of Invention 

This invention makes use of known genes whose 
expression is restricted to specific tissue, tissues or 
5 specialized cells ("restricted expression") to facilitate 
identification and manipulation of new genes and their 
associated transcription control elements which have 
similar patterns of expression. 

10 Accordingly, this invention provides a method of 

detecting a target gene having restricted expression in a 
eucaryotic organism, which comprises the steps of: 

(i) transforming a eucaryotic cell with a DNA 
15 sequence encoding a first indicator component 

under the control of a promoter having restricted 
expression; 

(ii) transforming the cell of (i) or a descendent of 
20 the cell of step (i) , by operably integrating 

into the cell's genome DNA lacking a promoter but 
which comprises a sequence encoding a second 
indicator component; 

25 (iii) producing tissue or specialized cells from the 

cell of (ii) ; and 

(iv) monitoring the tissue or specialized cells of 
(iii) for a detectable indicator resulting from 
3 0 both the first and second indicator components. 

This invention also provides a method of obtaining a 
gene, a part of a gene, transcription control element or 
other nucleotide sequence, having restricted expression 
3 5 which includes isolating endogenous DNA flanking of the 
sequence encoding the second indicator component from a 



WO 99/02719 



- 11 - 



PCT/CA98/00677 



cell or descendent of a cell prepared as described above in 
which the detectable indicator is found. This invention 
also provides a method of modifying the gene, 
transcriptional control element or other nucleotide 
5 sequence obtained as described above which includes 
modification of the endogenous DNA which flanks the coding 
sequence of the second indicator component . 

This invention also provides a method of providing a 
10 eucaryotic organism producing a detectable indicator in a 
specialized cell or tissue of the organism which includes 
growing a mult i -cellular eucaryotic organism from a cell or 
cell descendent from a cell prepared as described above, in 
which the detectable indicator is found. The organism may 
15 also express in the specialized cell or tissue, the product 
of heterologous DNA expressed with DNA flanking of the 
second indicator component of the detectable indicator. 

This invention makes use of a gene entrapment strategy 

20 whereby a detectable indictor is detected when more than 
one component of the indicator is expressed in the same 
cell. One method of this invention involves stable and 
operable insertion of a nucleic acid construct which 
includes one indicator component of the complementation 

25 trap under the control of a known promoter having 
restricted expression into the genome of the cell. This is 
followed by further transformation of the cell, or a cell 
derived from the previously transformed cell, by insertion 
into the cell's genome of a trap vector which includes a 

3 0 DNA which encodes a second indicator component. Insertion 
of the trap vector into an endogenous gene may result in 
the transcription of both the endogenous gene and the 
second indicator component under control of the endogenous 
promoter and subsequent translation of the second 

35 component. If the trap vector integrates into an 
endogenous gene which is expressed in the same cell or 
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tissue type as the first indicator component, a indicator 
resulting from the combination of the first and second 
indicator components may be detected. Thus, this invention 
allows one to distinguish target eukaryotic genes which are 
5 expressed in the same cell or tissue type as the promoter 
controlling the first component of the entrapment system. 
This system, as embodied by this invention is termed a 
"complementation trap". 

10 This invention provides a DNA construct 

comprising a splice acceptor upstream of a sequence 
encoding a component of a detectable indicator according to 
this invention. The component may be, for example, an 
enzyme fragment or subunit, or an enzyme functional in a 

15 pathway leading to production of a detectable indicator. 
This invention also provides a DNA construct comprising a 
promoter having restricted expression upstream of a 
sequence encoding a component of a detectable indicator as 
described above. 

20 

This invention also provides a first DNA construct 
comprising a splice acceptor upstream of a sequence 
encoding a first peptide selected from the group consisting 
of an alpha peptide and an omega peptide of /?-galactosidase 

25 (/3-gal) , wherein the first peptide lacks j3-gal activity but 
is capable of alpha complementation to produce active 
0-gal. This invention also provides a eucaryotic cell and 
a non-human, multi -cellular eucaryotic organism having 
operably and stably incorporated into its genome, the first 

3 0 DNA construct described above. This invention also 
provides a kit including the first DNA construct and, a 
second DNA construct comprising a promoter having 
restricted expression upstream of a sequence encoding a 
second /3-gal peptide selected such that the second peptide 

3 5 is capable of alpha complementation with the first peptide. 
This invention also provides a eucaryotic cell and a 
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non-human, multi-cellular eucaryotic organism having 
operably and stably incorporated into its genome the first 
DNA construct and which also expresses the second DNA 
construct . 

5 

Trap vectors useful for this invention, including the 
above -described first DNA construct, may also contain 
additional elements such as an internal ribosome entry 
sequence (IRES) upstream of the second indicator component 
10 (eg. the first peptide) or, sequences that allow for 
directional integration of a heterologous gene, such as a 
recognition site for site specific recombination (eg. lox) . 

This invention may be used to identify tissue or cell 
15 type specific genes. For example, a nucleic acid construct 
containing the coding sequence of a first indicator 
component under control of a known tissue specific promoter 
is "seeded" into cells which may be made to differentiate 
or whose nuclear material may be incorporated into cells 
20 which are capable of differentiation. The second component 
is then introduced into a cell having the "seeded" 
construct, by means of the trap vector. The resulting 
cells, or cells derived from those cells (eg. progeny), are 
then induced to differentiate (for example by addition or 
25 withdrawal of a chemical inducer/repressor) or allowed to 
spontaneously differentiate. The cells are then screened 
for activity of the indicator which will occur in the cell 
or tissue type in which the promoter is functional. 

3 0 This invention is particularly useful for screening 

cell or tissue specific genes or transcriptional control 
elements such as promoters and enhancers, in any animal 
from which embryonic stem (ES) cell lines may be obtained. 
The ES cell is subjected to the complementation trap 

35 process described above. The ES cells are then allowed to 
differentiate in vitro and cells are screened for the 
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indicator. ES cells demonstrating expression of the 
indicator may be introduced into a blastocyst to produce an 
animal which will exhibit the same pattern of expression. 
Where introduction of the trap vector results in disruption 
of the target gene, the animal resulting from the mutated 
ES cells will be useful as a "knock-out" animal useful for 
study of the target gene. Alternatively, location of the 
trap vector into the endogenous gene will provide means for 
subsequent disruption of the target gene for production of 
"knock-out" animals. The trap vector of this invention may 
be used as a means for locating, cloning, sequencing, and 
further mutation of the target gene or promoter or enhancer 
sequences associated with the target gene. 

This invention will make new transcriptional control 
elements available for use in making new transgenic animals 
displaying cell, tissue or organelle specific expression of 
transgenes. Promoters made available by use of this 
invention may themselves be made to control the first 
indicator component in the complementation trap of this 
invention in a "leap- frog" procedure whereby new genes 
having the same restriction pattern as the promoter or may 
be located. 

25 Brief Description of Drawings 

Figure 1: is a schematic illustrating a DNA construct 
useful in this invention comprising the myeloid cell 
specific promoter CDllb controlling the sequence encoding 
30 a /?-galactosidase alpha peptide terminated by the human 
growth hormone poly- adenylat ion signal. Downstream is a 
selection cassette, which in this case confers resistance 
to hygromycin driven by the phosphoglycerate kinase 
promoter. 
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WE CLAIM : 



1. A method of detecting a target gene having restricted 
expression in a eucaryotic organism, which comprises the 
steps of : 

(i) transforming a eucaryotic cell with a DNA 
sequence encoding a first indicator component under the 
control of a promoter having restricted expression; 

(ii) transforming the cell of (i) or a descendent of 
the cell by operably integrating into the cell's genome, 
DNA lacking a promoter but which comprises a sequence 
encoding a second indicator component; 

(iii) producing tissue or specialized cells from 
the cell of (ii) ; and 

(iv) monitoring the tissue or specialized cells of 
(iii) for a detectable indicator resulting from both the 
first and second indicator components. 

2. The method claim 1 wherein the eucaryotic cell is an 
ES cell. 



3. The method of claims 1 and 2 wherein the first and 
second indicator components are inactive fragments or 
subunits of an enzyme which, when combined, provide an 
active enzyme detectable by its activity. 

4. The method of claim 1 or 2 wherein the first and 
second indicator components are independently detectable or 
selectable, and the detectable indicator is the presence of 
both indicator components in a cell. 

5. The method of claim 1 or 2 wherein the first and 
second indicator components react in a sequence of 
reactions which result in a detectable indicator. 
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6. The method of any one of claims 1-5 which comprises 
the additional step of isolating DNA endogenous to the 
eucaryotic cell which flanks integrated DNA comprising the 
second indicator component . 

7. A method of producing an eucaryotic organism 
comprising a detectable indicator associated with a target 
gene having restricted expression, comprising steps (i) and 
(ii) of claim 1 to produce a transformed cell, followed by 
the step of growing an eucaryotic organism from said 
transformed cell. 

8. The method of claim 7 wherein the eucaryotic organism 
is a non-human mammal and the transformed cell is an ES 
cell. 

9. A DNA construct comprising, in a 5' to 3' direction, 
a splice acceptor and a sequence encoding an inactive 
subunit or fragment of an enzyme, wherein said subunit or 
fragment is active when combined with a further subunit or 
fragment of an enzyme. 

10. The DNA construct of claim 9 wherein the sequence 
encodes an alpha or omega fragment of j3-galactosidase . 

11. The DNA construct of claim 9 or 10 further comprising 
an IRES. 

12. The combination of: 

(i) a DNA construct for integration into the genome 
of an eucaryotic cell comprising a sequence encoding a 
first indicator component under the control of a promoter 
having restricted expression; and 

(ii) A DNA construct for integration into the genome 
of a eucaryotic cell, comprising in the 5' to 3' direction, 
a splice acceptor and a sequence encoding a second 
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indicator component and an optional IRES, 
wherein expression of both the first and second indicator 
components in said cell is detectable. 

13 . A eucaryotic cells transformed by the combination of 
DNA constructs of claim 12. 

14. The cell of claim 13 which is an ES cell. 
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